Control system and method for rack mounted computer units

ABSTRACT

The method includes receiving a command signal by a reset control module from another source such as a remote computer through a network. The command signal includes instructions to manipulate or control a computer unit such as a computer, computer component or computer system. An execution signal is transmitted for manipulation or control of the computer, computer component or computer system. The controlling or manipulation includes resetting, powering on or powering off the computer, computer component or computer system.

RELATED APPLICATION

[0001] This application claims priority to U.S. provisional patentapplication Serial No. 60/413,922, titled REMOTE RESET SYSTEM AND METHODFOR COMPUTER COMPONENTS AND SYSTEMS, filed Sep. 25, 2002, which ishereby incorporated by reference in its entirety. Additionally, priorityis claimed to U.S. non-provisional patent applications Ser. No.10/449,799, filed May 29, 2003, titled “Rack Mountable ComputerComponent and Method of Making Same”; Ser. No. 10/448,691, filed May 29,2003, titled “Rack Mountable Computer Component Cooling Method andDevice”; Ser. No. 10/449,608, filed May 29, 2003, titled “Rack MountableComputer Component For Cooling Arrangement and Method; and Ser. No.10/448,508, filed May 29, 2003, titled “Rack Mountable ComputerComponent Power Distribution Unit and Method”.

[0002] This application is related to U.S. patent application Ser. No.10/160,526, titled “Method and Apparatus for Rack Mounting ComputerComponents,” filed May 31, 2002, U.S. Provisional Application Serial No.60/384,996, titled “Rack Mountable Computer Component and Method ofMaking Same,” filed May 31, 2002; U.S. Provisional Application SerialNo. 60/384,987, titled “Rack Mountable Computer Component Cooling Methodand Device,” filed May 31, 2002; U.S. Provisional Application Serial No.60/384,986, titled “Rack Mountable Computer Component Fan CoolingArrangement and Method,” and U.S. Provisional Application Serial No.60/385,005, titled “Rack Mountable Computer Component Power DistributionUnit and Method,” filed May 31, 2002, which are each hereby incorporatedby reference in their entirety.

BACKGROUND OF THE INVENTION

[0003] 1. Field of the Invention

[0004] The present invention relates in general to computer components.It more particularly relates to a system and method for remotemonitoring and resetting of computer components or systems.

[0005] 2. Related Art

[0006] There have been a variety of different types and kinds of methodsand systems for mounting computer components. For example, reference maybe made to the following United States patents: PATENT NO. INVENTORISSUE DATE 4,258,967 Boudreau Mar. 31, 1081 4,879,634 Storrow et al.Nov. 07, 1989 4,977,532 Borkowicz et al. Dec. 11, 1990 5,010,444 Storrowet al. Apr. 23, 1991 5,216,579 Basara et al. Jun. 01, 1993 5,460,441Hastings et al. Oct. 24, 1995 5,571,256 Good et al. Nov. 05, 19965,684,671 Hobbs et al. Nov. 04, 1997 5,877,938 Hobbs et al. Mar. 02,1999 5,896,273 Varghese et al. Apr. 30, 1999 6,025,989 Ayd et al. Feb.15, 2000 6,058,025 Ecker et al. May 02, 2000 6,075,698 Hogan et al. Jun.13, 2000 6,220,456 B1 Jensen et al. Apr. 24, 2001 6,305,556 B1 MayerOct. 23, 2001 6,315,249 B1 Jensen et al. Nov. 13, 2001 6,325,636 B1 Hippet al. Dec. 04, 2001 Re. 35,915 Hastings et al. Oct. 06, 1998 Des.407,358 Belanger et al. Mar. 30, 1999

[0007] Computer systems such as networks, rack mounted computerclusters, or mainframes may generally be provided with a control that isphysically mounted on each unit for resetting the unit. In this regard,the performance of one or more units may be monitored remotely to detectpossible malfunctions. Such malfunctions are typically resolved simplyby shutting down and restarting, or resetting, the unit.

[0008] In a typical environment, many units may be monitored remotelywhile the units are located in another room, another building or evenanother city. When a malfunction is detected at the remote location, auser must physically travel to the location of the unit, determine whichof typically many physical units corresponds to the detectedmalfunction, and physically shut down and restart the unit. The user maythen return to the remote monitoring location. If the malfunctionre-occurs, the user must repeat the process, including the travelling tothe location of the unit.

BRIEF DESCRIPTION OF THE DRAWINGS

[0009] The following is a brief description of the drawings:

[0010]FIG. 1 is a pictorial view of a rack-mounted computer systemshowing the front, left side and top thereof, which may use anembodiment of the present invention;

[0011]FIG. 2 is a pictorial view of a housing of the system of FIG. 1,illustrating the process of installation of computer blades;

[0012]FIG. 3 is an enlarged scale top view of one embodiment of a bladeof the rack-mounted system of FIG. 1;

[0013]FIG. 4 is a left side elevational view of the blade of FIG. 3;

[0014]FIG. 5 is a block diagram of the system of FIG. 1, illustratingthe control arrangement according to one embodiment of the presentinvention;

[0015]FIG. 6 is a block diagram of a reset control module for thecontrol arrangement of FIG. 5;

[0016]FIG. 7 is a schematic diagram of an optical isolator arrangementof the reset control module of FIG. 6; and

[0017]FIG. 8 is a schematic diagram of an alternative embodiment of anoptical isolator arrangement of the reset control module of FIG. 6.

DESCRIPTION OF CERTAIN EMBODIMENTS OF THE INVENTION

[0018] The following detailed description is organized according to thefollowing outline:

[0019] A) General Overview

[0020] B) General System Description

[0021] C) Reset Control System

[0022] D) Reset Control Hardware

[0023] E) Reset Control Software Method

[0024] According to at least one of the disclosed embodiments of thepresent invention, there is provided a system and a method forcontrolling a computer unit such as a computer, a computer component ora computer system.

[0025] The method includes receiving a command signal by a reset controlmodule from another source such as a remote computer through a network.The command signal includes instructions to manipulate or control acomputer unit such as a computer, computer component or computer system.An execution signal is transmitted for manipulation or control of thecomputer, computer component or computer system. The controlling ormanipulation includes resetting, powering on or powering off thecomputer, computer component or computer system.

[0026] The method may also include receiving a data signal from at leastone of a sensor and the computer, computer component or computer system.The data signal may include information relating to operation of thecomputer unit such as a computer, computer component or computer system.A notification signal may be transmitted by the reset control module forreceipt by the remote computer, the notification signal beingtransmitted through the computer network and including the informationrelating to operation of the computer, computer component or computersystem.

[0027] In a preferred embodiment, the data signal includes an operatingtemperature of the computer, computer component or computer system. Theinformation may include an alarm indicating the operating temperatureexceeding a high temperature threshold or dropping below a lowtemperature threshold.

[0028] The remote computer may be a personal computer or a workstation.In one embodiment, the computer network is a public network, such as theInternet. In another embodiment, the computer network is an intranet ora local area network.

[0029] A disclosed embodiment of the system includes a reset controlmodule, one or more computer units such, for example, as computers,computer components or computer systems adapted to communicate with thereset control module. The reset control module is adapted to transmitinformation relating to operation of the computer units to a computer orother terminal such as a remote personal computer and to receiveinstructions therefrom for control or manipulation of the computer unit.The remote computer is adapted to communicate with the reset controlmodule through a computer network.

[0030] B. General System Description

[0031] Referring now to the drawings, and more particularly FIGS. 1-4,there is illustrated one embodiment of a vertical computer unit rackmounted system 10, which may be used with the diagnostic assembly of anembodiment of the invention as hereinafter described in greater detail.It should be understood that different rack mounted systems and othertypes and kinds of systems may also be employed, such, for example, ashorizontal rack mounted computer units. The rack mounted system 10includes a rack housing 12 configured generally as a rectangular boxhaving a plurality of vertically spaced-apart bays 14. The embodimentillustrated in the drawings includes three vertically spaced-apart bays14.

[0032] Each bay 14 is divided into a front bay portion 16 and a rear bayportion 18 by an intermediate transversely-extending horizontal divider19. The bays 14 are formed in the rack housing 12 in a verticallyspaced-apart manner one above the other. In a bottom portion of the rackhousing 12, a control bay 21 is provided to house various controlcomponents for controlling various computer units of the system 10according to the disclosed embodiments of the present invention, ashereinafter described in greater detail.

[0033] The rack housing 12 further includes a fan/LAN tray slot 23 aboveeach bay 14. Each fan/LAN tray slot is configured to accommodate afan/LAN tray such as tray 27.

[0034] The system illustrated in the drawings provides a control bay 21used for controlling the system 10 as hereinafter described in greaterdetail. The control bay 21 has a bottom opening 25 for facilitating airflow to receive vertically moving air flow from a vent opening 26 in afloor 28 and vertically through the system 10 as assisted by the fan/LANtrays. At the top of the rack housing 12, an apertured top panel 26 isprovided to permit venting of the vertically moving air flow from thesystem 10.

[0035] At the top portion of each bay 14, in the intermediate regionbetween the front bay portion 16 and the rear bay portion 18, a powerdistribution unit (PDU) 29 is provided to supply electricity to variouscomponents mounted in the rack mounted system. Each bay is adapted toaccommodate a plurality of computer components in the form of openstructure computer blades, such as blade 32, in each of the front bayportions 16 and the rear bay portions 18. In the embodiment illustratedin the figures, eleven blades may be accommodated in each of the frontbay and rear bay portions. Thus, in the illustrated embodiment, thesystem 10 accommodates 66 computer components in a densely compact,closely spaced configuration.

[0036] Referring now to FIGS. 2-4, the blades 32 and their installationinto the rack housing 12 will now be described in greater detail. Eachblade is provided with a pair of handles 54 which allow a user to easilymanipulate the blade 32 to be grasped by the user to slide the bladeinto or out of its bay. Each blade 32 may include one or more motherboards 56. In the system illustrated in FIGS. 3 and 4, each blade 32includes two mother boards 56 a, 56 b. Those skilled in the art willappreciate that the number of mother boards included in each blade 32may be varied according to design. The mother board may include heatsinks such as heat sinks 58 and 59 for facilitating the cooling of themother boards. Examples of the heat sinks are disclosed in greaterdetail in U.S. provisional application Serial No. 60/384,487, filed May31, 2002. Further, each mother board is provided with random accessmemory (RAM) 61. The amount of RAM 61 provided for each mother board maybe varied as needed. A pair of power supply 63 a, 63 b may be providedon the blade 32 for supplying power to their corresponding mother boards56 a, 56 b. Similarly, a pair of hard disks 64 a, 64 b may also beprovided on the blade 32.

[0037] All of the components are mounted on one side of a rigid plate orsupport 64, which is adapted to be supported vertically within its bay.Each blade 32 includes a cut-out corner portion or section 65 in itsupper back portion. The cut-out portion 65 is sized to receive andaccommodate the PDU 29 therebetween such that two opposing blades 32 and32 a accommodate the PDU 29 almost completely. Thus, a substantiallyzero footprint is achieved for the PDU 29. Each blade 32 is providedwith an AC power inlet such as an inlet 67 at or near the cut-outportion 65. Thus, when the blade 32 is installed into the rack housing12, the AC power inlet 67 engages electrically a corresponding ACconnector such as a connector 76 of the PDU 29.

[0038] As most clearly illustrated in FIG. 2, the installation of theblade 32 may be achieved in a fast and efficient manner. The blade 32 issimply slid into either the front bay portion 16 or the rear bay portion18 of a bay 14 of the rack housing 12. Each blade 32 is slid back untilits AC power inlet 67 engages a corresponding AC connector 76 on the PDU29. The intermediate dividers 19 serve as a back stop for the blades 32.Each blade 32 is secured in its slot by four blade screws 69, whichattach the blade 32 to the rack housing 12.

[0039] Once the blade 32 has been mounted onto the rack housing 12, ashort blade/LAN connector cable such as a cable 71 provides electricalnetworking connection between the blade 32 and a network such as a localarea network, wide area network or a public network such as theinternet. In this regard, the mother boards are each mounted at thefront of each blade, and thus access thereto is readily available atfront outlets.

[0040] Each rack system 10 may include one or more master bladeslocated, for example, in the front lowest bay, with the remainder of theblades being slave blades being at least partially controlled by orthrough the master blades.

[0041] C. Reset Control System

[0042] A system for controlling computer units such as the computercomponents or systems described above with reference to FIGS. 1-4 willnow be described in greater detail with reference to FIGS. 5-8. Thedisclosed embodiments of the system and methods enable a user to controlone or more computer units, and the control can be executed remotely, ifdesired, without physically travelling to the location of the computers,computer components or computer systems to be controlled. It is to beunderstood that the control can be executed locally as well, and can beaccomplished according to certain embodiments of the inventionsubstantially without human intervention.

[0043]FIG. 5 illustrates one embodiment of a system for controlling bymonitoring and/or resetting of computer units mounted on, for example,the rack assembly described above with reference to FIGS. 1-4 eitherremotely or locally. In this embodiment, the computer system 10 isconstructed and arranged with the blades, such as the blade 32, servingas slaves. The blades are designated in FIG. 5 with the letter “S” suchas the blade 32. Also, one of the blades serves as a master blade 103.Of course, it will be understood by those skilled in the art that thecomputer system 10 may be provided with more than one master blade 103,and any number of slave blades. In one embodiment, no master blades areprovided, and the existing blades (no longer slave blades) arecontrolled directly, rather than through the master blade.

[0044] The system 10 further includes thermal couples, such as thermalcouple 105. The thermal couples may be strategically located, forexample, within each bay of a rack system. In further embodiments, athermal couple is provided near or within each component such as ablade. Accordingly, the number of thermal couples included in the system10 may be varied as needed.

[0045] The thermal couple 105 is adapted to detect a temperature of, forexample, an environment around or within a component. In this regard,the thermal couples may detect the operating temperature of thecomputer, computer component or computer system and transmit the valueto an external recipient. Such thermal couples are well known to thoseskilled in the art.

[0046] The blades, such as slave blade 32, and the thermal couples, suchas thermal couple 105, are adapted to communicate with a reset controlmodule 107 forming a part of the control bay 21 through cables, such ascables 101 a and 105 a. In a preferred embodiment, each cable linkingthe reset control module 107 to a blade or a thermal couple is atwo-wire cable.

[0047] The reset control module 107 is adapted to receive data from eachthermal couple, such as thermal couple 105. Further, the reset controlmodule 107 is adapted to receive and send signals from and to thevarious blades, including slave blade 32 and master blade 105. In thisregard, the reset control module 107 may receive computer unitperformance signals including performance data for each blade. Forexample, the data may include indications of a malfunction andrequesting attention.

[0048] A user, such as a system administrator may communicate with thereset control module 107 using a remotely located computer 112, such asa personal computer or a workstation. It should be understood that thecomputer 112 may also be located on the same site as the computer unitsbeing monitored. The computer 112 may communicate with the reset controlmodule 107 through a network 114 of computers such as an Intranet or alocal area network (LAN). The computer 112 may be located in a differentroom, building or city from the system 100. The network 114 allowstwo-way communication between the reset control module 107 and thecomputer 112. The user may also communicate with the reset controlmodule 107 using a personal computer 116 through a public network 118such as the Internet.

[0049] In operation, the reset control module 107 receives signals fromthe thermal couples, such as thermal couple 105, through the cables,such as cable 105 a. The signals may include data relating to theoperating temperature of the computer, computer component or computersystem, such as a blade. The data is transmitted to the reset controlmodule 107 from each thermal couple. The reset control module 107 mayreceive the data at a pre-determined frequency, such as one hertz,thereby providing regular updates to the reset control module 107.

[0050] Further, the reset control module 107 may also receive signalsfrom the various computers, computer components or computer systems,such as blades 103, 101. These signals may include further data relatingto the operation of the blades, such as operating efficiency, capacity,etc.

[0051] The temperature and other operating information may be monitoredthrough the networks 114 or 118 by the user at the computers 112 or 116.If a malfunction is detected at one of the blades, for example, a signalmay be transmitted from one of the computers such as the computers 112and 116 to the reset control module 107 to shut down or reset theparticular computer, component or system. For example, a malfunction maybe detected as a temperature above a pre-determined threshold at one ofthe thermal couples. The high temperature may indicate that, forexample, a ventilation fan has failed, thereby threatening to destroy ordamage one or more components. In this scenario, either one blade or anentire bay of blades may be shut down or reset.

[0052] The reset is performed when a signal is sent from one of theremote computers 112 and 116 to the reset control module 107. The signalmay contain instructions for the reset control module 107 to shut downthe necessary components. The reset control module 107, in turn, mayitself shut down or reset the appropriate components. Alternatively, thereset control module 107 may transmit a further signal to the componentrequesting the component reset itself.

[0053] In a further embodiment, the reset control module 107 may beprovided with a pre-set threshold for resetting various components. Forexample, the reset control module 107 may be provided with a maximumtemperature detected by the thermal couples. If the thermal couplesindicate a temperature above that threshold, the reset control module107 may initiate the reset process automatically without involving theremote computers 112 and 116. In this regard, the threshold may bemodified remotely by the user using the computer 112, 116. A signal maybe transmitted from one of the computers 112 and 116 to the resetcontrol module 107 through a network such as the networks 114 and 118providing a new threshold.

[0054] D. Reset Control Hardware

[0055] Referring now to FIG. 6, the hardware design of one embodimentfor the reset control module 107 is illustrated in the form of a printedcircuit board (PCB) assembly. FIG. 6 shows the functional blocks andconnector interfaces contained on the PCB assembly for the module 107.

[0056] The disclosed embodiment of the module 121 provides control of136 power and reset outputs and monitoring of temperature input via amini web server 123. The mini web server 123 may be implemented usingthe Dallas Semiconductor TINI™ product. The control and monitoringfunctions may be provided via a Java™ Applet embedded into an html webpage. The disclosed embodiment of the module 107 includes an integralTINI™ PCB 123, communications interfaces 125, 127, and 129, and 136isolated digital outputs indicated at 132.

[0057] The power input requirements for the disclosed embodiment of thePCB assembly 121 are 7-to-30V AC or DC at 500 mA maximum. The powerinput is not isolated from the PCB logic and internal ground. The powerinput is made via a two-part header 134 that can accept #22 to #16 AWGwire and is supplied to a low voltage power supply 135.

[0058] The PCB assembly 121 is contains a watchdog timer 133 thatprovides supervision of the application software and monitors the logicpower supply. The watchdog timer 133 is used to increase the reliabilityof the system and eliminates the need for a manual restart in the eventof an unforeseen malfunction.

[0059] The disclosed embodiment of the module 107 connects to local areanetworks via a 10Base-T Ethernet interface 129 that is terminated by aRJ-45 connector (not shown). The interface 129 is generally terminatedat one of the networks hub or switch ports. The Ethernet interfaceTCP/IP settings can be made via the RS232 port 125 or a network Telnetsession. The former is used for PCB assembly's “1^(st) Birthday”configuration or when the network settings are unknown.

[0060] The disclosed embodiment of the module 107 includes a DallasSemiconductor 1-Wire™ network that is terminated by a standard RJ-11connector 127. The pin out arrangement is compatible with a variety ofthird party 1-Wire™ I/O devices that range from temperature sensors tocomplex I/O points.

[0061] The disclosed embodiment of the PCB assembly 121 contains aserial RS232 communication port 125 that is terminated by a DB9 femaleconnector (not shown). The RS232 port 125 may interface to a personalcomputer with a common null modem cable. The RS232 port 125 is used toaccess the system software for configuration purposes, and isfunctionality available for future expansion of the product.

[0062] The disclosed PCB assembly 121 contains 136 digital outputsindicated at 132 that can be remotely controlled via a web browser andTCP/IP. The output states are written to latch devices 137 via decodingand interface logic 139 by the application software. Each output channelis optically isolated by means of optical isolators 142 from the PCBassembly 121 ground or common point by at least 1500 VAC continuous.Each channel can be independently controlled via the applicationsoftware. All of the outputs go to the inactive state when the PCB (TINICPU) is in a system reset. Each channel provides a current sink outputcapable of switching up to 50 mA to a return wire at 12 VDC. The digitaloutputs are terminated to a connector and pin out arrangement (notshown). The digital output common return isolation may be arranged witheither a common or independent return path as described below withreference to FIGS. 7 and 8.

[0063]FIG. 7 illustrates schematically the optical isolators 142 and howthey are coupled to the digital outputs 132. The arrangement shown inFIG. 7 is a common return path topology. The return path for multiplechannels can share a common grounded connector pin such as a pin 152. Itis presently preferred in the disclosed embodiment that the sharing ofthe common ground should preferably be limited to six channels or fewerfor some applications. The advantage for some applications is that theconnector requires less than two pins per output channel.

[0064] The optical isolators 142 include optocouplers, such asoptocoupler 155, which have their outputs coupled through currentlimiting resistors such as a resistor 157 for the optocoupler 155, tothe digital outputs 132. The outputs 132 include a first series of pins,such as a pin 159, connected to the optocouplers 142, and a secondseries of pins such as the pin 152. The first and second series of pinsare arranged in pairs, such as the pins 159 and 152. Thus, for example,when the optocoupler 155 is activated, ground is then switched to thepin 159 to send the control signal to a given blade. It should beunderstood that only two optocouplers and four pairs of digital outputpins are illustrated for sake of simplicity.

[0065]FIG. 8 is another embodiment of an optical isolator arrangement inthe form of an independent return path topology. The advantage of thisconfiguration for some applications is that the signal commons on thetarget systems do not get interconnected.

[0066] The optical isolator 170 of FIG. 8 includes a group ofoptocouplers such as an optocoupler 172 and a group of digital outputs174 arranged in pairs of pins such as pins 176 and 178. The pin 176 isconnected through a current limiting resistor 181 to one output of theoptocoupler 172 and its other output is connected directly to the pin178. Thus, there are no common grounds.

[0067] Referring again to FIG. 6, the disclosed embodiment of the PCBassembly 121 may contain footprints for the circuitry necessary to addan additional 512K FLASH memory 138. The additional FLASH memory may beused to store larger application programs.

[0068] E. Reset Control Software Method

[0069] The software functional operation and design of one embodiment ofa printed circuit board (PCB) assembly 121 for use with the resetcontrol module 107 will now be described.

[0070] The disclosed embodiment of the PCB assembly 121 provides controlof 136 power and reset outputs and monitoring of temperature input viathe mini web server 123. The control and monitoring functions may beprovided via a Java™ Applet embedded into an html web page. The userconnects to the system using a web browser and opens the control andmonitor web page that then starts the Applet program. After Appletinitialization, a login dialog box may be displayed requiring a usernameand password. The user must successfully login before the user canaccess the Applet controls and displays.

[0071] Once successfully logged in, the user can select 1 of 68 possiblepower and reset pairs to be controlled. The user can select to sendeither a power or reset trigger. The power trigger is selected to beeither a “Power On” or a “Power Off” signal. A button is pressed toexecute the command and to transmit the signal to the appropriate poweror reset output. This signal remains active for a preset, configurabletime and automatically clears itself without user intervention. Thetemperature input reading is periodically updated on the Applet displayand displayed in degrees Celsius or Fahrenheit as configured. Aconfigurable description for the temperature input is also displayedbeside the temperature value. An alarm flag may be displayed in theevent that the temperature value exceeds a “High Alarm” set point ordrops below a “Low Alarm” set point. An email message can be configuredto be sent when an alarm is active.

[0072] The software for the RackSwitch product may be divided into twomain software sub components as follows: the I/O board, and the controlApplet.

[0073] The I/O board software component is designed to operate on theDallas Semiconductor TINI™ server 123 and may be based on the DallasSemiconductor TINI™ Operating System. The operating software is providedby Dallas Semiconductor for development on the TINI™ server 123. The I/OBoard may include 136 digital outputs and a 1-Wire™ temperature input.The digital outputs are defined as a reset or a power output.

[0074] The I/O Board software performs several functions in addition tothe functions provided by the Dallas Semiconductor TINI™ OperatingSystem. The I/O Board implements a small HTTP server used to serve thecontrol Applet and associated html file when a user request is received.It updates the power and reset outputs based on user requests and timingspecifications, and maintains a system log file. The I/O Boardperiodically reads the temperature inputs and maintains a temperaturevalue between readings. The I/O Board sends email alarms to a configuredemail address when the temperature reading exceeds a high set point ordrops below a low set point. It maintains and services user TCP/IPnetwork connections, and provides username and password login functionsfor the control Applet. The I/O Board implements system initializationand configuration, and reads and calculates current date and time basedon a real-time clock interface provided with the Dallas SemiconductorTINI™server 123.

[0075] The TINI™ Operating System may be the basis for most softwareoperation. In addition to many other features the operating systemprovides password utilities for managing user accounts, setting thecurrent date and time and configuring network settings. The operatingsystem may be modified for the reset system. During operating systeminitialization, if the password file is corrupt and/or can not be openedor found, then a default password file is automatically created thatcontains a default username and password. During operating systeminitialization, the operating system startup file is automaticallycreated when the file is corrupt and/or can not be opened or found. Thedefault startup file contains the startup line call and the defaultcommand line parameters.

[0076] Usernames and passwords provide privilege levels: administrativeand general. Administrative privilege levels allow a user unlimitedaccess to any aspect of the system as well as any TINI™ Operating Systemconfigurations and file system operations. General privilege levelpermits the user unlimited access to the system features and functions,but limited access to the TINI™ Operating System configurations andfunctions. The usernames and passwords are maintained in a standard textfile with the passwords being encrypted using the TINI™ Operating Systempassword encryption function. Utilities provided in the TINI™ OperatingSystem provide means for adding, deleting and/or modifying systemusernames and passwords.

[0077] The system log file is a standard text file that lists systemevents with a date and time stamp. System events include, but are notlimited to, system startup, user logins, Java runtime exceptions,records of user actions for “Power On,” “Power Off” and “Reset” andtemperature alarms when configured and enabled.

[0078] The temperature sensor may be based on the Dallas Semiconductor1-Wire™ Temperature Sensor, DS18S20. The temperature input may be readperiodically and may update the temperature value transmitted to allconnected users. The temperature input provides configurable parametersfor a description field to be displayed by the Applet, a unit designatorfor Fahrenheit or Celsius, “Alarm High” set point, “Alarm Low” setpoint, alarm enable, and an email alarm enable. When the “Alarm High”set point is exceeded, the High alarm flag is set if the alarm isenabled. When the temperature value is less than the “Alarm Low” setpoint then Low alarm flag is set if the alarm is enabled. If eitheralarm is active and the email alarm enable is set then the email alarmflag is set.

[0079] The outputs are classified as a reset or a power output. Eachoutput can be activated for a configured amount of time, for example,from one to thirty seconds. Times configured less than 1 second areautomatically set to 1 second and configured times greater than 30seconds are automatically set to 30 seconds so that the range of timevalues is always valid. The time configuration is a global configurationfor all power and reset outputs and may not be individuallyconfigurable.

[0080] The power and reset are paired together and the user can performactions on a single power and reset pair or all power and reset pairs.

[0081] Email alarms for the temperature input will be automatically sentfor active alarms. Configurable email alarm parameters are: Delay Time(seconds), Repeat Count, Mail To Address. The Delay Time is the amountof time to delay after an alarm has gone active before sending the emailalarm. The Repeat Count is the number of times the email alarm is sentwhile the alarm is active. The Mail To address is the email address ofthe recipient for the email alarm. When an alarm is activated the emailalarm function waits the Delay Time and then sends the email to theconfigured recipient. If the Repeat Count is greater than or equal to 1then another email alarm will be sent after the Delay Time expiresagain. This will repeat for the Repeat Count times. If the email alarmwould become inactive anytime during this process then the alarm activestate is cleared and the email alarm would cease and reinitialize. Ifthe email alarm would become active again the process would start overagain.

[0082] A mail host is configured in the TINI™ Operating System IPConfiguration using the provided configuration command.

[0083] The configurable parameters are configured through a standardtext configuration file. Parameters are typically comma delimited. Theconfiguration parameters are read once during initialization andstartup. The configuration file is modifiable using a common text editorprogram and may be transferred to the TINI™ file system using a FTPutility. The IP port number may not be configurable in the standard textconfiguration file, but may be configured in the TINI™ Operating Systemstartup file which is a standard text file edited and transferred usinga common text editor and FTP utility.

[0084] The Dallas Semiconductor TINI™ server 123 provides a Real-TimeClock function. The date and time are set using the TINI™ OperatingSystem configuration command provided. The I/O board periodically readsthe Real-Time Clock and converts the reading to the current date andtime.

[0085] The system communicates using two IP port numbers. One IP port isthe standard HTTP port, port 80. The second port is configurable and canbe any valid IP port number. The default communication IP port number isport 1025. The second IP port is used for data communications betweenthe control Applet and the I/O board. Periodic data messages aretransmitted to each connected user. This periodic communicationmaintains the connection status for each Applet to the I/O Board andcontains the date and time data.

[0086] The software component implements a Java™ Applet that providesthe user graphical interface for controlling and monitoring the powerand reset outputs and the temperature input. After initialization, theApplet opens a TCP/IP socket connection to the I/O board using aconfigurable IP port number, establishes and maintains the networkconnection during operation. Once the network connection is establishedand the Applet initialization completed, a user login box displays,requiring a valid username and password. After successful login, theApplet controls and displays become visible.

[0087] The Applet provides a list box from which 1 of 68 power and resetoutput pairs are selected for operation. In addition to the 1 to 68power and reset output pairs in the list box, an “All” selection isprovided to select all outputs for the selected action. The user thenselects the action to be performed: “Power On,” “Power Off” or “Reset.”The default action is “Reset.” Once the selection has been completed theuser activates a control button to complete the action. Then the poweror reset output for the selected pair(s) is activated.

[0088] The Applet displays the current temperature reading with aconfigurable label in the configured units either Fahrenheit or Celsius.Fahrenheit units will be the default display. In addition to thetemperature input display the current set date and time will bedisplayed. The time will be displayed with a precision to one second andis continuously updated while a network connection is established withthe I/O board.

[0089] If the network communications fails, the Applet displays an errorbanner indicating this failure.

[0090] Thus, a remote user may monitor and shut down or reset acomputer, computer component or computer system without physicallytravelling to the location of the computer, computer component orcomputer system.

[0091] While particular embodiments of the present invention have beendisclosed, it is to be understood that various different modificationsand combinations are possible and are contemplated within the truespirit and scope of the invention. There is no intention, therefore, oflimitations of the appended claims to the exact disclosure pr abstractherein presented.

What is claimed is:
 1. A method of controlling a group of computer unitsmounted on a rack, comprising: receiving a computer unit performancesignals at a reset control module mounted on the rack; generating acontrol signal in response to the receipt of the performance signal fora given control unit; and sending the control signal to the given one ofthe computer units for causing it to be controlled in response thereto.2. A method according to claim 1, wherein said control signal is a resetsignal for resetting the given one of the computer units.
 3. A methodaccording to claim 2, wherein said control signal is indicative ofeither controlling the power to the given one of the computer units orrequesting it to trigger a reset mode of operation.
 4. A methodaccording to claim 1, wherein said receiving computer unit performancesignals are received from a sensing device mounted on the rack fordetecting malfunctions.
 5. A method according to claim 4, furtherincluding determining whether the condition is within pre-determinedlimits.
 6. A method according to claim 5, wherein said control signal isa reset signal generated when the condition is determined to be outsideof the predetermined limits.
 7. A method according to claim 4, whereinsaid sensing device is a temperature sensing device.
 8. A methodaccording to claim 5, further including sending an alarm message to aremote computer to indicate that a malfunction has occurred when it isdetermined that the condition is outside said limits.
 9. A methodaccording to claim 8, wherein said message is sent after a predeterminedtime delay following the determination that the condition is outsidesaid limits.
 10. A method according to claim 9, further includingrepeating the sending of the alarm message after another time delayinterval.